Overview

Dataset statistics

Number of variables16
Number of observations9023
Missing cells2527
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 MiB
Average record size in memory345.8 B

Variable types

NUM10
CAT6

Reproduction

Analysis started2020-03-01 12:52:23.453889
Analysis finished2020-03-01 12:53:29.419453
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
name has a high cardinality: 8869 distinct values High cardinality
host_name has a high cardinality: 2423 distinct values High cardinality
neighbourhood has a high cardinality: 89 distinct values High cardinality
last_review has a high cardinality: 987 distinct values High cardinality
neighbourhood is highly correlated with neighbourhood_groupHigh Correlation
neighbourhood_group is highly correlated with neighbourhoodHigh Correlation
last_review has 1261 (14.0%) missing values Missing
reviews_per_month has 1261 (14.0%) missing values Missing
number_of_reviews has 1261 (14.0%) zeros Zeros
availability_365 has 2054 (22.8%) zeros Zeros

Variables

id
Real number (ℝ≥0)

UNIQUE
Distinct count9023
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21626150.66164247
Minimum2318
Maximum40261634
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum2318
5-th percentile2516840.9
Q113205937.5
median21804258
Q331882021
95-th percentile38502364.6
Maximum40261634
Range40259316
Interquartile range (IQR)18676083.5

Descriptive statistics

Standard deviation11201265.03
Coefficient of variation (CV)0.5179500135
Kurtosis-1.033035632
Mean21626150.66
Median Absolute Deviation (MAD)9403002.647
Skewness-0.1574237174
Sum1.951327574e+11
Variance1.254683382e+14
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.31800000e+03 2.87535000e+04 1.52047500e+06 1.52056500e+06 6.36137900e+06 ... 3.93906090e+07 3.93996065e+07 3.99489895e+07 3.99500735e+07 4.02616340e+07], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7178239 1 < 0.1%
 
3818746 1 < 0.1%
 
7089415 1 < 0.1%
 
14011651 1 < 0.1%
 
38213555 1 < 0.1%
 
22818047 1 < 0.1%
 
35120024 1 < 0.1%
 
4439293 1 < 0.1%
 
18390265 1 < 0.1%
 
18291977 1 < 0.1%
 
Other values (9013) 9013 99.9%
 
ValueCountFrequency (%) 
2318 1 < 0.1%
 
5682 1 < 0.1%
 
6606 1 < 0.1%
 
9419 1 < 0.1%
 
9460 1 < 0.1%
 
ValueCountFrequency (%) 
40261634 1 < 0.1%
 
40197071 1 < 0.1%
 
40183377 1 < 0.1%
 
40183149 1 < 0.1%
 
40176359 1 < 0.1%
 

name
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count8869
Unique (%)98.3%
Missing1
Missing (%)< 0.1%
Memory size35.3 KiB
SoBe Westlake Apartments 2 Bedroom
 
9
Day 1 | Summer Intern Housing | DT Seattle xx
 
7
Downtown Seattle | Summer Rental | Corp Housing xx
 
7
DT Seattle | Summer Rental | 30day special rate xx
 
7
Urban 1 Bedroom Apartment in SLU
 
6
Other values (8864)
8986
ValueCountFrequency (%) 
SoBe Westlake Apartments 2 Bedroom 9 0.1%
 
Day 1 | Summer Intern Housing | DT Seattle xx 7 0.1%
 
Downtown Seattle | Summer Rental | Corp Housing xx 7 0.1%
 
DT Seattle | Summer Rental | 30day special rate xx 7 0.1%
 
Urban 1 Bedroom Apartment in SLU 6 0.1%
 
SoBe Downtown Seattle Apartments 5 0.1%
 
1 Bedroom Apartment in SLU 4 < 0.1%
 
Spacious and Cozy Home 4 < 0.1%
 
Corporate HighRise Apartment on Pine T1 4 < 0.1%
 
Loft Near downtown/Capitol hill 4 < 0.1%
 
Other values (8859) 8965 99.4%
 

Length

Max length136
Mean length38.42535742
Min length2
ValueCountFrequency (%) 
Other_Letter 85 39.5%
 
Lowercase_Letter 27 12.6%
 
Uppercase_Letter 26 12.1%
 
Other_Symbol 20 9.3%
 
Other_Punctuation 15 7.0%
 
Decimal_Number 10 4.7%
 
Math_Symbol 6 2.8%
 
Final_Punctuation 3 1.4%
 
Dash_Punctuation 3 1.4%
 
Close_Punctuation 3 1.4%
 
Other values (10) 17 7.9%
 
ValueCountFrequency (%) 
Common 72 33.5%
 
Han 70 32.6%
 
Latin 54 25.1%
 
Hangul 9 4.2%
 
Devanagari 8 3.7%
 
Inherited 2 0.9%
 
ValueCountFrequency (%) 
ASCII 90 44.1%
 
CJK 70 34.3%
 
Hangul 9 4.4%
 
Dingbats 9 4.4%
 
Misc Symbols 9 4.4%
 
Punctuation 8 3.9%
 
Devanagari 8 3.9%
 
VS 1 0.5%
 

host_id
Real number (ℝ≥0)

Distinct count5233
Unique (%)58.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65151068.86312756
Minimum20
Maximum310961317
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum20
5-th percentile890201.9
Q18534462
median32307630
Q389449231.5
95-th percentile253517662.7
Maximum310961317
Range310961297
Interquartile range (IQR)80914769.5

Descriptive statistics

Standard deviation77948164.05
Coefficient of variation (CV)1.196421876
Kurtosis1.080937619
Mean65151068.86
Median Absolute Deviation (MAD)60831791.15
Skewness1.457367062
Sum5.878580944e+11
Variance6.075916279e+15
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[2.00000000e+01 3.05815000e+04 3.35985000e+04 7.38320000e+04 7.68775000e+04 ... 2.85003798e+08 2.85305949e+08 2.94072704e+08 2.94665106e+08 3.10961317e+08], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
8534462 346 3.8%
 
48005494 237 2.6%
 
50550045 152 1.7%
 
82961680 138 1.5%
 
229095817 117 1.3%
 
4962900 92 1.0%
 
114353388 91 1.0%
 
1243056 58 0.6%
 
74305 58 0.6%
 
222592495 52 0.6%
 
Other values (5223) 7682 85.1%
 
ValueCountFrequency (%) 
20 1 < 0.1%
 
862 1 < 0.1%
 
1877 1 < 0.1%
 
2536 2 < 0.1%
 
4193 4 < 0.1%
 
ValueCountFrequency (%) 
310961317 1 < 0.1%
 
309783739 2 < 0.1%
 
309771614 1 < 0.1%
 
309387078 2 < 0.1%
 
309006739 1 < 0.1%
 

host_name
Categorical

HIGH CARDINALITY
Distinct count2423
Unique (%)26.9%
Missing4
Missing (%)< 0.1%
Memory size35.3 KiB
Corp Condos & Apts
 
346
Zeus
 
237
Stay Alfred
 
183
Day 1
 
152
Addison
 
138
Other values (2418)
7963
ValueCountFrequency (%) 
Corp Condos & Apts 346 3.8%
 
Zeus 237 2.6%
 
Stay Alfred 183 2.0%
 
Day 1 152 1.7%
 
Addison 138 1.5%
 
Loftium 117 1.3%
 
David 75 0.8%
 
Melissa 74 0.8%
 
Michael 69 0.8%
 
Andrew 68 0.8%
 
Other values (2413) 7560 83.8%
 

Length

Max length34
Mean length7.163249474
Min length1
ValueCountFrequency (%) 
Lowercase_Letter 28 41.2%
 
Uppercase_Letter 26 38.2%
 
Other_Punctuation 6 8.8%
 
Decimal_Number 2 2.9%
 
Math_Symbol 2 2.9%
 
Space_Separator 1 1.5%
 
Dash_Punctuation 1 1.5%
 
Open_Punctuation 1 1.5%
 
Close_Punctuation 1 1.5%
 
ValueCountFrequency (%) 
Latin 54 79.4%
 
Common 14 20.6%
 
ValueCountFrequency (%) 
ASCII 66 100.0%
 

neighbourhood_group
Categorical

HIGH CORRELATION
Distinct count17
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size35.3 KiB
Downtown
1748
Other neighborhoods
1636
Capitol Hill
916
Central Area
782
Queen Anne
 
647
Other values (12)
3294
ValueCountFrequency (%) 
Downtown 1748 19.4%
 
Other neighborhoods 1636 18.1%
 
Capitol Hill 916 10.2%
 
Central Area 782 8.7%
 
Queen Anne 647 7.2%
 
West Seattle 483 5.4%
 
Ballard 454 5.0%
 
Rainier Valley 432 4.8%
 
Cascade 419 4.6%
 
Beacon Hill 323 3.6%
 
Other values (7) 1183 13.1%
 

Length

Max length19
Mean length11.7837748
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 20 52.6%
 
Uppercase_Letter 17 44.7%
 
Space_Separator 1 2.6%
 
ValueCountFrequency (%) 
Latin 37 97.4%
 
Common 1 2.6%
 
ValueCountFrequency (%) 
ASCII 38 100.0%
 

neighbourhood
Categorical

HIGH CARDINALITY
HIGH CORRELATION
Distinct count89
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size35.3 KiB
Broadway
 
543
Belltown
 
540
Central Business District
 
395
Wallingford
 
325
First Hill
 
316
Other values (84)
6904
ValueCountFrequency (%) 
Broadway 543 6.0%
 
Belltown 540 6.0%
 
Central Business District 395 4.4%
 
Wallingford 325 3.6%
 
First Hill 316 3.5%
 
Minor 291 3.2%
 
Fremont 269 3.0%
 
University District 256 2.8%
 
South Lake Union 248 2.7%
 
Pike-Market 245 2.7%
 
Other values (79) 5595 62.0%
 

Length

Max length25
Mean length11.4717943
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 24 49.0%
 
Uppercase_Letter 22 44.9%
 
Dash_Punctuation 1 2.0%
 
Space_Separator 1 2.0%
 
Other_Punctuation 1 2.0%
 
ValueCountFrequency (%) 
Latin 46 93.9%
 
Common 3 6.1%
 
ValueCountFrequency (%) 
ASCII 49 100.0%
 

latitude
Real number (ℝ≥0)

Distinct count6537
Unique (%)72.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.625185596764595
Minimum47.495870000000004
Maximum47.73395
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum47.49587
5-th percentile47.540986
Q147.605535
median47.61984
Q347.659275
95-th percentile47.698086
Maximum47.73395
Range0.23808
Interquartile range (IQR)0.05374

Descriptive statistics

Standard deviation0.04554863553
Coefficient of variation (CV)0.0009563980688
Kurtosis-0.1027247102
Mean47.6251856
Median Absolute Deviation (MAD)0.03539203091
Skewness-0.1868813729
Sum429722.0496
Variance0.002074678199
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[47.49587 47.50955 47.524 47.55026 47.598045 ... 47.650805 47.678215 47.69673 47.705245 47.73395 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
47.61108 8 0.1%
 
47.61003 7 0.1%
 
47.61602 7 0.1%
 
47.61183 7 0.1%
 
47.61146 7 0.1%
 
47.61109 7 0.1%
 
47.61243 7 0.1%
 
47.61172 6 0.1%
 
47.6112 6 0.1%
 
47.61369 6 0.1%
 
Other values (6527) 8955 99.2%
 
ValueCountFrequency (%) 
47.49587 1 < 0.1%
 
47.49656 1 < 0.1%
 
47.49661 1 < 0.1%
 
47.49732 1 < 0.1%
 
47.49741 1 < 0.1%
 
ValueCountFrequency (%) 
47.73395 1 < 0.1%
 
47.73385 1 < 0.1%
 
47.73369 1 < 0.1%
 
47.73364 1 < 0.1%
 
47.73362 1 < 0.1%
 

longitude
Real number (ℝ)

Distinct count6215
Unique (%)68.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-122.3337326875762
Minimum-122.41925
Maximum-122.24095
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum-122.41925
5-th percentile-122.389387
Q1-122.353365
median-122.33301
Q3-122.312565
95-th percentile-122.283245
Maximum-122.24095
Range0.1783
Interquartile range (IQR)0.0408

Descriptive statistics

Standard deviation0.0313327359
Coefficient of variation (CV)-0.0002561250704
Kurtosis-0.2670155781
Mean-122.3337327
Median Absolute Deviation (MAD)0.02504650342
Skewness-0.1261254543
Sum-1103817.27
Variance0.000981740339
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-122.41925 -122.415155 -122.402825 -122.3905 -122.362995 ... -122.286195 -122.274545 -122.26429 -122.25913 -122.24095 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-122.33635 7 0.1%
 
-122.33786 7 0.1%
 
-122.33917 6 0.1%
 
-122.34175 6 0.1%
 
-122.32378 6 0.1%
 
-122.32793 6 0.1%
 
-122.32889 6 0.1%
 
-122.34229 6 0.1%
 
-122.3493 5 0.1%
 
-122.33662 5 0.1%
 
Other values (6205) 8963 99.3%
 
ValueCountFrequency (%) 
-122.41925 1 < 0.1%
 
-122.41908 1 < 0.1%
 
-122.41839 1 < 0.1%
 
-122.41791 1 < 0.1%
 
-122.41784 1 < 0.1%
 
ValueCountFrequency (%) 
-122.24095 1 < 0.1%
 
-122.2412 1 < 0.1%
 
-122.24135 1 < 0.1%
 
-122.24204 1 < 0.1%
 
-122.24215 1 < 0.1%
 

room_type
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size35.3 KiB
Entire home/apt
6793
Private room
1908
Shared room
 
174
Hotel room
 
148
ValueCountFrequency (%) 
Entire home/apt 6793 75.3%
 
Private room 1908 21.1%
 
Shared room 174 1.9%
 
Hotel room 148 1.6%
 

Length

Max length15
Mean length14.20647235
Min length10
ValueCountFrequency (%) 
Lowercase_Letter 13 68.4%
 
Uppercase_Letter 4 21.1%
 
Space_Separator 1 5.3%
 
Other_Punctuation 1 5.3%
 
ValueCountFrequency (%) 
Latin 17 89.5%
 
Common 2 10.5%
 
ValueCountFrequency (%) 
ASCII 19 100.0%
 

price
Real number (ℝ≥0)

Distinct count400
Unique (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean170.37138424027486
Minimum0
Maximum9999
Zeros2
Zeros (%)< 0.1%
Memory size70.6 KiB

Quantile statistics

Minimum0
5-th percentile44
Q180
median119
Q3186.5
95-th percentile450
Maximum9999
Range9999
Interquartile range (IQR)106.5

Descriptive statistics

Standard deviation220.6638496
Coefficient of variation (CV)1.295193149
Kurtosis513.260198
Mean170.3713842
Median Absolute Deviation (MAD)104.3783858
Skewness14.89515356
Sum1537261
Variance48692.53453
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 12.5 29.5 30.5 34.5 ... 765.5 987. 1008.5 1575. 9999. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
100 333 3.7%
 
150 326 3.6%
 
75 233 2.6%
 
125 232 2.6%
 
99 216 2.4%
 
90 206 2.3%
 
120 196 2.2%
 
200 192 2.1%
 
80 189 2.1%
 
85 171 1.9%
 
Other values (390) 6729 74.6%
 
ValueCountFrequency (%) 
0 2 < 0.1%
 
10 11 0.1%
 
15 26 0.3%
 
17 1 < 0.1%
 
18 20 0.2%
 
ValueCountFrequency (%) 
9999 1 < 0.1%
 
5400 1 < 0.1%
 
5000 1 < 0.1%
 
4000 1 < 0.1%
 
3000 1 < 0.1%
 

minimum_nights
Real number (ℝ≥0)

Distinct count47
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.052643245040453
Minimum1
Maximum400
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile30
Maximum400
Range399
Interquartile range (IQR)2

Descriptive statistics

Standard deviation14.73778795
Coefficient of variation (CV)2.91684713
Kurtosis297.289298
Mean5.052643245
Median Absolute Deviation (MAD)5.635676895
Skewness14.14179398
Sum45590
Variance217.2023935
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 5.5 ... 29.5 30.5 31.5 181. 400. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 3466 38.4%
 
1 3077 34.1%
 
3 1032 11.4%
 
30 647 7.2%
 
4 235 2.6%
 
5 175 1.9%
 
7 132 1.5%
 
6 48 0.5%
 
14 33 0.4%
 
10 31 0.3%
 
Other values (37) 147 1.6%
 
ValueCountFrequency (%) 
1 3077 34.1%
 
2 3466 38.4%
 
3 1032 11.4%
 
4 235 2.6%
 
5 175 1.9%
 
ValueCountFrequency (%) 
400 1 < 0.1%
 
365 3 < 0.1%
 
360 1 < 0.1%
 
345 1 < 0.1%
 
330 1 < 0.1%
 

number_of_reviews
Real number (ℝ≥0)

ZEROS
Distinct count408
Unique (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.344231408622406
Minimum0
Maximum795
Zeros1261
Zeros (%)14.0%
Memory size70.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q13
median18
Q366
95-th percentile206
Maximum795
Range795
Interquartile range (IQR)63

Descriptive statistics

Standard deviation75.89981672
Coefficient of variation (CV)1.507616952
Kurtosis9.928687275
Mean50.34423141
Median Absolute Deviation (MAD)52.63297346
Skewness2.673289324
Sum454256
Variance5760.782179
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.000e+00 5.000e-01 1.500e+00 4.500e+00 7.500e+00 ... 2.735e+02 3.135e+02 4.195e+02 5.545e+02 7.950e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1261 14.0%
 
1 590 6.5%
 
2 344 3.8%
 
3 305 3.4%
 
4 261 2.9%
 
6 207 2.3%
 
7 176 2.0%
 
5 175 1.9%
 
8 155 1.7%
 
9 143 1.6%
 
Other values (398) 5406 59.9%
 
ValueCountFrequency (%) 
0 1261 14.0%
 
1 590 6.5%
 
2 344 3.8%
 
3 305 3.4%
 
4 261 2.9%
 
ValueCountFrequency (%) 
795 1 < 0.1%
 
778 1 < 0.1%
 
733 1 < 0.1%
 
640 1 < 0.1%
 
569 1 < 0.1%
 

last_review
Categorical

HIGH CARDINALITY
MISSING
Distinct count987
Unique (%)12.7%
Missing1261
Missing (%)14.0%
Memory size35.3 KiB
2019-11-17
 
369
2019-11-11
 
331
2019-11-03
 
264
2019-11-10
 
263
2019-10-20
 
207
Other values (982)
6328
ValueCountFrequency (%) 
2019-11-17 369 4.1%
 
2019-11-11 331 3.7%
 
2019-11-03 264 2.9%
 
2019-11-10 263 2.9%
 
2019-10-20 207 2.3%
 
2019-11-04 205 2.3%
 
2019-11-18 194 2.2%
 
2019-10-27 161 1.8%
 
2019-11-16 126 1.4%
 
2019-10-21 123 1.4%
 
Other values (977) 5519 61.2%
 
(Missing) 1261 14.0%
 

Length

Max length10
Mean length9.021722265
Min length3
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Lowercase_Letter 2 15.4%
 
Dash_Punctuation 1 7.7%
 
ValueCountFrequency (%) 
Common 11 84.6%
 
Latin 2 15.4%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

reviews_per_month
Real number (ℝ≥0)

MISSING
Distinct count897
Unique (%)11.6%
Missing1261
Missing (%)14.0%
Infinite0
Infinite (%)0.0%
Mean2.3141084771965983
Minimum0.01
Maximum14.8
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum0.01
5-th percentile0.08
Q10.48
median1.6
Q33.61
95-th percentile6.7995
Maximum14.8
Range14.79
Interquartile range (IQR)3.13

Descriptive statistics

Standard deviation2.242515455
Coefficient of variation (CV)0.9690623743
Kurtosis1.304659226
Mean2.314108477
Median Absolute Deviation (MAD)1.807217491
Skewness1.233886034
Sum17962.11
Variance5.028875568
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 115 1.3%
 
0.04 76 0.8%
 
0.06 61 0.7%
 
0.07 59 0.7%
 
0.05 59 0.7%
 
0.09 58 0.6%
 
0.11 55 0.6%
 
0.15 54 0.6%
 
0.02 54 0.6%
 
0.19 53 0.6%
 
Other values (887) 7118 78.9%
 
(Missing) 1261 14.0%
 
ValueCountFrequency (%) 
0.01 6 0.1%
 
0.02 54 0.6%
 
0.03 48 0.5%
 
0.04 76 0.8%
 
0.05 59 0.7%
 
ValueCountFrequency (%) 
14.8 1 < 0.1%
 
14.26 1 < 0.1%
 
14.19 1 < 0.1%
 
14.08 1 < 0.1%
 
13.33 1 < 0.1%
 

calculated_host_listings_count
Real number (ℝ≥0)

Distinct count37
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.48199046880195
Minimum1
Maximum346
Zeros0
Zeros (%)0.0%
Memory size70.6 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q38
95-th percentile237
Maximum346
Range345
Interquartile range (IQR)7

Descriptive statistics

Standard deviation78.64723792
Coefficient of variation (CV)2.421256727
Kurtosis8.073467018
Mean32.48199047
Median Absolute Deviation (MAD)48.23541032
Skewness2.967937978
Sum293085
Variance6185.388032
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1. 1.5 2.5 3.5 5.5 ... 55. 127.5 145. 291.5 346. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 4244 47.0%
 
2 1170 13.0%
 
3 576 6.4%
 
346 346 3.8%
 
237 237 2.6%
 
4 232 2.6%
 
5 185 2.1%
 
6 156 1.7%
 
152 152 1.7%
 
138 138 1.5%
 
Other values (27) 1587 17.6%
 
ValueCountFrequency (%) 
1 4244 47.0%
 
2 1170 13.0%
 
3 576 6.4%
 
4 232 2.6%
 
5 185 2.1%
 
ValueCountFrequency (%) 
346 346 3.8%
 
237 237 2.6%
 
152 152 1.7%
 
138 138 1.5%
 
117 117 1.3%
 

availability_365
Real number (ℝ≥0)

ZEROS
Distinct count365
Unique (%)4.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean139.20691565998004
Minimum0
Maximum365
Zeros2054
Zeros (%)22.8%
Memory size70.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q17
median90
Q3270
95-th percentile362
Maximum365
Range365
Interquartile range (IQR)263

Descriptive statistics

Standard deviation133.1438122
Coefficient of variation (CV)0.9564453864
Kurtosis-1.26810398
Mean139.2069157
Median Absolute Deviation (MAD)117.4576913
Skewness0.5133156261
Sum1256064
Variance17727.27474
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 17.5 18.5 40.5 ... 354.5 362.5 363.5 364.5 365. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2054 22.8%
 
365 238 2.6%
 
364 134 1.5%
 
90 91 1.0%
 
41 80 0.9%
 
363 77 0.9%
 
180 65 0.7%
 
49 64 0.7%
 
324 62 0.7%
 
18 54 0.6%
 
Other values (355) 6104 67.6%
 
ValueCountFrequency (%) 
0 2054 22.8%
 
1 40 0.4%
 
2 29 0.3%
 
3 31 0.3%
 
4 29 0.3%
 
ValueCountFrequency (%) 
365 238 2.6%
 
364 134 1.5%
 
363 77 0.9%
 
362 52 0.6%
 
361 39 0.4%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

idnamehost_idhost_nameneighbourhood_groupneighbourhoodlatitudelongituderoom_typepriceminimum_nightsnumber_of_reviewslast_reviewreviews_per_monthcalculated_host_listings_countavailability_365
02318Casa Madrona - Urban Oasis 1 block from the park!2536MeganCentral AreaMadrona47.61082-122.29082Entire home/apt2967292019-10-310.21259
15682Cozy Studio, min. to downtown -WiFi8993MaddyDelridgeSouth Delridge47.52398-122.35989Entire home/apt4834622018-11-243.9210
26606Fab, private seattle urban cottage!14942JoyceOther neighborhoodsWallingford47.65411-122.33761Entire home/apt9021502019-09-281.19349
39419Glorious sun room w/ memory foambed30559AngielenaOther neighborhoodsGeorgetown47.55062-122.32014Private room6221462019-10-221.298359
49460Downtown Convention Center B&B -- Free Minibar30832SienaDowntownFirst Hill47.61265-122.32936Private room9934552019-11-093.654138
59531The Adorable Sweet Orange Craftsman31481CassieWest SeattleFairmount Park47.55539-122.38474Entire home/apt1653392019-09-200.412336
69534The Coolest Tangerine Dream MIL!31481CassieWest SeattleFairmount Park47.55624-122.38598Entire home/apt1252462019-10-280.482346
79596the down home , spacious, central and fab!14942JoyceOther neighborhoodsWallingford47.65479-122.33652Entire home/apt1202932019-09-220.9130
89909Luna Lower - West Seattle33360LauraWest SeattleFairmount Park47.56521-122.37375Entire home/apt1253732019-10-210.608347
911012the orange house, quiet 'n central14942JoyceOther neighborhoodsWallingford47.65448-122.33646Entire home/apt2992912019-09-010.763177

Last rows

idnamehost_idhost_nameneighbourhood_groupneighbourhoodlatitudelongituderoom_typepriceminimum_nightsnumber_of_reviewslast_reviewreviews_per_monthcalculated_host_listings_countavailability_365
901340156785Enthralling and Comfy 1-BR Apartment in Seattle306367598RoderickDowntownFirst Hill47.60830-122.32818Entire home/apt12010NaNNaN2364
901440159265Seattle Dreamy and Scenic 1 Bedroom Apartment306367598RoderickDowntownFirst Hill47.60912-122.32898Entire home/apt15810NaNNaN2333
901540162643Two Bedroom Downtown Oasis (Parking Included) WS97208530431Seattle Super SuitesDowntownFirst Hill47.61108-122.32895Entire home/apt19930NaNNaN2117
901640174107Cozy Downtown Quarters w/ City Views183583319AlexDowntownBelltown47.61554-122.34561Entire home/apt11520NaNNaN438
901740175430Clean private Rm in comfortable North Seattle home7435040ChengyingNorthgateHaller Lake47.72271-122.33583Private room4010NaNNaN8234
9018401763592 clean private rooms big window in North Seattle7435040ChengyingNorthgateHaller Lake47.72269-122.33539Private room8020NaNNaN8234
901940183149Laurel's House21013086Ron PaulOther neighborhoodsFremont47.65662-122.34548Entire home/apt60300NaNNaN2176
902040183377Entire House *Walker’s Pradise*Good Transit289666185NhatRainier ValleyColumbia City47.56200-122.29087Entire home/apt8910NaNNaN1356
902140197071Steps to Pike Place and Gum Wall226137890XeniaDowntownPike-Market47.60866-122.33936Entire home/apt10710NaNNaN1125
902240261634Seattle home that’s close to everything310961317JeffreyWest SeattleFairmount Park47.54959-122.37772Entire home/apt12010NaNNaN1173